ABRIR at NTCIR-9 GeoTime Task Usage of Wikipedia and GeoNames for Handling Named Entity Information
نویسنده
چکیده
ABRIR at NTCIR-8 Approach C t ti f N d E tit d t b Construction of Boolean query by using pseudo relevant documents Named entity is more important than others Verb synonym list is used for increasing coverage Combination of Boolean IR model and probabilistic IR model (Okapi) Penalty is applied for documents that don’t satisfy Boolean query Named entity is more important than others Failure analysis at NTCIR-8 ons ruc on o ame n y a a ase Usage of Linked Open Data GeoNames : Large-scale database of geographic entities Wikipedia: Information resource for named entity Database construction Geographic entities from GeoNames and Wikipedia • Add Japanese translation of GeoNames entry by using Wikipedia information [Takenaka, et. al., 2011] Other named entities from Wikipedia • Usage of DBPedia category information to identfy type of named entity (Person, Organization Place and Infrastructure) for making a list Identification of named entities and relationships between these entities and the query is important 「アフリカ(Africa)」⇔「コンゴ民主共和国(Democratic Republic of the Congo)」 Quality of pseudo relevant documents Topic drift Verb synonym expansion Some verbs are not important for Boolean query construction
منابع مشابه
NTCIR-9 GeoTime at Osaka Kyoiku University - Toward Automatic Extraction of Place/Time Terms -
Our approach to NTCIR-9 Geotime was to obtain place/time information about topics from Wikipedia and Google using query terms extracted from topics. Adding this information to query terms, we retrieved documents using tag index and scored them. In addition, we compared tag of searched documents with time information, weighted the score value of documents retrieved, and ranked them...
متن کاملNTCIR9-GeoTime Overview - Evaluating Geographic and Temporal Search: Round 2
ABSTRACT GeoTime for the NTCIR Workshop 9 is the second evaluation of Geographic and Temporal Information Retrieval called “NTCIR GeoTime”. The focus of this task is on search with Geographic and Temporal constraints. This overview describes the data collections (Japanese and English news stories), topic development, assessment results and lessons learned from this second NTCIR GeoTime task, wh...
متن کاملUniversity of Alicante at NTCIR-9 GeoTime
In this paper we present a complete system for the treatment of the geographical dimension in the text and its application to information retrieval. This system has been evaluated in the GeoTime task of the 9th NTCIR workshop, making it possible to compare the system with other current approaches to the topic. In order to participate in this task we have added to our GIR system the temporal dim...
متن کاملNTCIR-8 GeoTime at Osaka Kyoiku University: Hierarchical Index for Geographic Retrieval
We retrieved topics that contained the geographic and temporal information at NTCIR-8 GeoTime task. Employing morphological analysis, temporal and geographic information are extracted from GeoTime collection. The index that represents a geographic hierarchy is made from the geographic information. In the experiment, we confirmed that the effect of the geographic hierarchical index when topics i...
متن کاملSINAI at NTCIR-9 GeoTime: a filtering and reranking approach based solely on geographical entities
Geographic Information Retrieval (GIR) is an active and growing research area that focuses on the retrieval of textual documents according to a geographical criteria of relevance. In recent years, the IR research community has paid particular attention in IR systems that take into account temporal constraints. Temporal Information Retrieval (TIR) is a recent field which addresses the combinatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011